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Uniform endomorphisms which are 
isomorphic to a Bernoulli shift 

By Christopher Hoffman and Daniel Rudolph 
Abstract 

A uniformly p- to- one endomorphism is a measure-preserving map with 
entropy log p which is almost everywhere p-to-one and for which the condi- 
tional expectation of each preimage is precisely 1/p. The standard example 
of this is a one-sided p-shift with uniform i.i.d. Bernoulli measure. We give 
a characterization of those uniformly finite-to-one endomorphisms conjugate 
to this standard example by a condition on the past tree of names which is 
analogous to very weakly Bernoulli or loosely Bernoulli. As a consequence we 
show that a large class of isometric extensions of the standard example are 
conjugate to it. 



1. Introduction 

Perhaps the most significant aspect of Ornstein's isomorphism theory for 
Bernoulli shifts [5] is the very weak Bernoulli condition which Ornstein and 
Weiss proved characterizes isomorphism to a Bernoulli shift [6], [8]. The very 
weak Bernoulli (v.w.B.) condition has been exploited to show that many classes 
of transformations are isomorphic to Bernoulli shifts. Examples of these in- 
clude ergodic toral automorphisms and geodesic flows on negatively curved 
space [4], [9]. The Kakutani equivalence theory of Feldman, Ornstein and 
Weiss has a similar condition, loosely Bernoulli (I.B.), which shows when a 
transformation is Kakutani equivalent to a Bernoulli shift [7]. The existence 
of such a criterion is the hallmark of these and parallel theories. 

We consider here uniformly p- to- one endomorphisms, measure-preserving 
endomorphisms with entropy logp which are a.s. p-to-one and for which the 
conditional expectations of the preimages are all equal to 1/p. The one-sided 
Bernoulli shift on p symbols, each equally likely, is the standard example of a 
uniformly p-to-one endomorphism. This endomorphism has state space B = 
{0, 1, . . . ,p — 1} N . The measure v is defined by 



v(b | b = ao, b% = ai, . . . ,b m = a m ) = 1/p 



rn+l 
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for any sequence ciq, . . . ,a m where all the ai 6 {0, 1, ... ,p — 1}. The action 
on B is the left shift a(b)i = bi+\. In this paper we give a characterization of 
those uniformly p-to-one endomorphisms conjugate to the standard example 
via a criterion parallel to v.w.B. and l.B. 

We now describe the class of uniformly two-to-one endomorphisms known 
as the [T, Id] examples. Let (B,a,u) be the standard two-to-one endomor- 
phism. For Y any Lebesgue space and T an ergodic automorphism of Y, 
[T, Id] is defined by 

[T,ld](x,y) = (a(x),T^(y)). 

Parry [10] has shown that for T, an irrational rotation of the circle extremely 
well approximated by rationals, [T, Id] is always conjugate to the standard 
example (a itself). In Section 6 we will show that [T, Id] is conjugate to the 
standard example for all ergodic isometrics T. 

The rest of the paper is organized as follows. In Section 2 we define tree 
very weak Bernoulli and prove that a large natural class of factors of tree v.w.B. 
endomorphisms are tree v.w.B. In Section 3 we will introduce the notion of a 
one-sided joining of two endomorphisms. In Section 4 we introduce the concept 
of tree finitely determined and prove that tree very weak Bernoulli implies tree 
finitely determined. Section 5 contains the bulk of the proof. It will follow the 
same outline as the Burton-Rothstein approach to Ornstein's theorem. First 
there will be a Rokhlin lemma and a strong Rokhlin lemma for endomorphisms. 
This is followed by the copying lemma, which is the main tool in the proof. 
The proof of our copying lemma is much easier than the proof of the copying 
lemma in Ornstein's theorem because, perhaps surprisingly, we will not have 
to deal with entropy. In Section 6 we apply the tree v.w.B. characterization to 
show that a large class of isometric extensions of the standard endomorphism 
are one-sidedly conjugate to it. 

The methods of this work are much more broadly applicable than to just 
uniformly finite-to-one endomorphisms. For any bounded finite-to-one endo- 
morphism (X, T, fi) one can assign to each point x a tree of inverse images. 
We will discuss this in great detail as we continue. Each point on this tree 
can be weighted with the expectation that the path of inverse images to that 
point has occurred. This assignment of a weighted tree to x is a conjugacy 
invariant of the endomorphism. Moreover, the expectations of just the inverse 
images T~ 1 (x) set a natural lower bound on the entropy of T. The uniformly 
finite-to-one maps are the simplest bounded finite-to-one endomorphisms in 
that the weighted trees do not depend on x and the entropy of T is at its mini- 
mum given the tree. To apply our methods to the general bounded finite-to-one 
case requires working relative to the sub-cr-algebra generated by the trees, with 
perhaps nonzero entropy relative to the trees. Both issues are standard in the 
theory and require no essentially new ideas. We have focused on the simplest 
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case here as it allows the presentation of all the new ideas without the extra 
baggage and also as it arises in a variety of natural situations, for example 
rational maps of the Riemann sphere and ergodic group endomorphisms. All 
of these issues will be presented separately. 

Isomorphism theories for endomorphisms have been presented previously. 
The two cases we are aware of are del Junco [2] and Ashley, Marcus and Tuncel 
[1]. Del Junco considers two-sided Bernoulli shifts and is after conjugacies that 
are Unitary, one-sided in one direction but two-sided in the other. Thus his 
work is significantly different from ours. It is interesting to note his use of 
what he calls a * joining which is strongly analogous to our one-sided joinings. 
The work of Ashley, Marcus and Tuncel is both more restrictive and more 
general than ours as they consider all finite state Markov chains. As our 
examples indicate, many standard endomorphisms are not directly Markov. 
On the other hand, the existence of a period in a Markov chain will make it 
nonstandard. We expect future work on relativizing our methods will provide 
an alternative approach to this classification of one-sided Markov chains just as 
relativizing Ornstein's theorem provided a classification of two-sided Markov 
chains. 

The authors wish to thank A. del Junco for discovering several gaps and 
one serious error in the previous version of this paper. 



2. Tree very weak Bernoulli 

In this section we will define what it means for an endomorphism to be 
tree very weakly Bernoulli. As with v.w.B. and I.B., our criterion concerns 
names on orbits. A p-to-1 uniform endomorphism T and a point x generate a 
p-ary tree of inverse images. We will describe how to define a tree name given 
the tree of inverse images and any partition or, more generally, any metric 
space- valued function on X. Simply stated the tree very weakly Bernoulli 
criterion says that for any such function and a.e. two points x and x' one can 
match these two tree names with an arbitrarily small density of errors by a 
map that preserves the tree structure. 

Consider a p-ary tree with p n nodes at each index n > 0. Each node at 
index n connects to p nodes at the index n+1. We assign each such set of p 
nodes a distinct value in {0, . . . ,p — 1}. Then we label each node other than 
the root by the sequence of values we see moving from the root to the node. 
In this form we can concatenate nodes v' and v by concatenating their labels. 
Notice that when we fix a node v, the set of labels vv' form a p-ary subtree 
rooted at node v. This is consistent with our convention that the root node is 
unlabeled. 
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Call this labeled tree T and call the tree that has all the nodes at index 
less than or equal to n, T n . Let 77 be the set of nodes of T and % be the set 
of nodes for T n . For v G 77 and at index i (i.e. v G rji \ we write \v\ = i 
and where v is a list of values v±, . . . ,Vi from {0, . . . ,p — 1} which is the list of 
labels of the nodes along the branch from the root to v. 

Let A be the collection of all bijections of the nodes of T that preserve 
the tree structure. We refer to this as the group of tree automorphisms. Let 
A n be the bijections of the nodes of T n preserving the tree structure. To give 
a representation to such automorphisms A notice that from A we obtain a 
permutation ir v of {0, . . . ,p — 1} at each node, giving the rearrangement of its 
p predecessors. An automorphism of T n will be represented by an assignment 
of a permutation of {0, ... ,p — 1} to each node of the tree except for those at 
index n. 

Let (X, T, /j, T) be a uniformly p-to-one endomorphism. Then each i£l 
has p inverse images. Select a measurable p set partition K of X such that 
almost every x has one preimage in each element of K. Label the sets of K 
as Kq, Ki, . . . , Kp—i. For each i 6 {0, . . . ,p — 1} and x £ X define Tj(x) to be 
the preimage of x in K^. We now define a set of partial inverses for T. For 
v = (vi, ... ,Vi) £ n define T v (x) = T Vi (- ■ ■ (T Vl (x))). 

We let R and U be compact metric spaces. We will use d generically for a 
metric on R and U and more generally for any metric space considered, and will 
always assume that the labeling spaces R and U have d diameter precisely 1. 

A T,R name h is any function from T to R. We say it is tree adapted if 
for any v £ 77 and z, j G {0, ... ,p — 1} with i ^ j we have 7^ and 

we say it is strongly tree adapted if d(h{vi),h{vj)) = 1. A map / : X — > i? 
generates T, i? names by T x (v) = f(T v (x)). We say that / is (strongly) tree 
adapted if 7^ (v) is (strongly) tree adapted for almost every x. Let Q be the 
cj-algebra on X generated by pullback of the a-algebra on R. We say / gen- 
erates if ViT~ l (Q) = T. "Strongly tree adapted" depends explicitly on the 
choice of metric but obviously, tree adapted, does not. For any / : X — ► R, the 
map / V K : X — ► R x {0, . . . , p — 1} will always be tree strongly tree adapted. 
More generally so long as d(h(vi), h(vj)) is bounded uniformly away from zero 
by some a > we can replace d by an equivalent metric of diameter 1 making 
h strongly tree adapted. Just set 

d new (x,y) = min(l,d(x,y)//3) 

for some < (3 < a. 

Any measure-preserving endomorphism (X, T, /j) and function / : X — > R 
generate a stationary sequence of random variables R4 = f o T l ,i > 0. We 
regard such a sequence as a measure on i? N with the weak* topology. There 
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is a unique extension of this measure to all of Br, preserving stationarity. For 
a.e. x € X let {i?«(x)}j<o be random variables with distribution Dist(i?j,i < 
0\B i = f(T i (x)),i>0). 

Any tree adapted T, i?-name h, generates an B- valued sequence of ran- 
dom variables {-Ri(7t)}i<o- To a cylinder set r_j, r_j+i, . . . , r_i we assign the 
measure equal to p _J times the number of all nodes at index j whose name to 
index 1 is the word r_j, . . . , r_i. For (A, T, p) a uniform p-to-one endomor- 
phism and / : X — ► B tree adapted and generating and T x the T, i?-name of 
x, {Bi(x)}i < o and {-Rj('7i)}i<o are just two descriptions of the same sequence 
of random variables. 

We now put a family of metrics on T, i?-names (and on T n , i?-names). For 
two T, .R-names h and h! we define 

t n (h,ti)= min - ]T p-^d(h(v),h'(A(v))). 

0<\v\<n 

For the two names h and /i' in this definition, if one follows each branch 
from the root and through the tree and writes down the name seen along that 
branch, one obtains p n different names. Giving each name a mass of p~ n one 
obtains the two sequences of random variables Bi(h) and Bi(h'), —n < i < 0. 
A matching of the trees via a tree automorphism gives a coupling of these 
two distributions. The weighting of nodes is such that the calculation of the 
t distance is precisely the d distance between the random variables one would 
calculate from this coupling. In this sense t is at least as large as d, which would 
be the inf over all couplings, not just those that come from tree automorphisms. 

Definition 2.1. Let (X, T, p) be a uniform p-to-one endomorphism and 
/ a tree adapted map from X — > B. We say (X, T, /j.) and / are tree very weak 
Bernoulli (tree v.w.B.) if for any e > and all N sufficiently large there is a 
set G = G(e, N) with > 1 — e such that for any x,y G G, 

tN(T x ,T y ) < e. 

It is fairly direct that tree v.w.B. endomorphisms are exact and hence 
always ergodic [3], [13]. Our next goal is to show that a large natural class of 
factors of tree v.w.B. endomorphisms is again tree v.w.B. First we define the 
class we are interested in. 

Definition 2.2. We say a factor map (f> from (X, T, /j,) to (Y, S, v) is tree 
adapted if for a.e. point x the map <fi restricted to T~ l (x) is one-to-one into 

s-\4>{x))- 

It is not difficult to construct factor maps between endomorphisms that 
are not tree adapted. On the other hand a conjugacy is clearly tree adapted 
in both directions. Our constructions will never leave the class of tree adapted 
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factor maps. We now give a useful little technical lemma that tree adapted 
factors of uniform p-to-one endomorphisms are themselves uniform p-to-one 
endomorphisms . 

Lemma 2.3. Suppose (X,T,/j,) is a uniform p-to- one endomorphism and 
(Y, S, v) is a tree adapted factor of {X, T, jjl) by a map <p. Then <fi restricted to 
T _1 (x) is almost surely onto S~ 1 ((f)(x)) and (Y, S, v) is also a uniform p-to- one 
endomorphism. 

Proof. Because the map (f> is tree adapted the endomorphism Y is a.e. at 
least p-to-one. For each x £ </> -1 (y), x' G T~ l (x) and (fi(x') = y' the conditional 
probability of x' given x is the same as the conditional probability of y' given x 
and y (i.e. 1/p). Thus the entropy is at least log p. As the entropy of X is log 
p the entropy of Y can be at most log p. Thus the conditional probability of y' 
given only y must be 1 /p and (Y, S, v) is a uniform p-to-one endomorphism. □ 

Corollary 2.4. For (X,T,n) a uniform p-to- one endomorphism, R 
compact metric and f : X — > R tree adapted, the left shift on the sequence of 
random variables Ri = f oT ! is a uniform p-to- one endomorphism. 

Lemma 2.5. Suppose (X,T,fi) is a uniform p-to- one endomorphism, f is 
tree adapted and generating, (X,T,fi) and f are tree v.w.B. and (Y,S,v) is a 
factor by a tree adapted factor map 4>. Then for any tree adapted g : Y — > R, 
(Y, S, v) and g are also tree v.w.B. 

Proof. By Lemma 2.3, we know (Y, S, v) is a uniform p-to-one endomor- 
phism. Let Q be the a-algebra of / measurable sets; hence go <p is V^o 
measurable. Writing this as a finite approximation, for each e > we can find 
an s G N, a 5 > and a subset G'CI with /J.(G) > 1 — e so that if x, x' G G' 
and 

d{f{T i {x))J{T i (x'))) < 5 for < i < s then 
d(g( ( f ) (x)),g( ( t > (x > )))<s. 
As T is ergodic the mean ergodic theorem tells us that 

\ E P~ lvl XG>(T v (x)) A^(G') 

0<\v\<n 

in L 1 . (Convergence here can be shown to be pointwise but as this is not 
standard and we do not need it we just quote the mean convergence which 
follows directly from rewriting this average as an average over the forward 
images of T~ n (x).) 

Suppose x, x' and N satisfy 

5 2 

tN{%,%>) < — , 
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^ E p- lvl XG>(T v (x))>l-2e 

0<\v\<N 

and 

E p- lvl XG>(T v (x>))>l-2e. 

0<\v\<N 

Then there is an A G -4 ra given by 

1 E p- M d(/(r,(x)),/(r A( , ) (x')))<^. 

JV 0<M<iV zs 

Let Z = {v\0 < \v\ < N^ifiT* o T w (x)), /(T* o T^x'))) < <5} and 
conclude 

If v £ Z and both T v (x) and TV^x') are in G' then 

d(# o (j)(T v (x)),go 4>(T A{v) {x'))) < e and hence 

1 E p-^d(g{S v {ct ) {x))),9(SA {v Mx'))))<Se + 5. 

0<\v\<N 

That (Y, S, v) is tree v.w.B. now follows. □ 

3. One-sided couplings 

As we are following the Burton-Rothstein approach to the isomorphism 
theorem we will be considering joinings of two endomorphisms (X, T, jj) and 
(Y, S, v) . A coupling of two spaces (X, /j,) and (Y, is a measure on X x Y 
which has marginals fi and ia A joining of (X, T, /u) and (Y, S 1 , /x) is a coupling 
of (X, //) and (Y, zv) which is invariant under T x S. We will not consider all 
of the joinings of (X, jj) and (Y, v) but rather a collection we call one-sided 
joinings. In this section we define one-sided joinings and prove some facts 
about them. 

Definition 3.1. Suppose S is a subset of Z and {i?j}s and {£/j}s are two 
sequences of random variables. A one-sided coupling is a coupling of these two 
sequences such that for all n and i > n 

Dist({i? j } j<i |{i? i } i > i , {Uj}j>i) = Vtit({Rj}j<i\{Rj}j>i), 
and symmetrically 

Dist({l7 J -} i «|{J2,-} i > ij = Dist({C/ J } j<i |{C/ j } j > i ). 



If the sequences are stationary then the one-sided couplings that also are sta- 
tionary are called one-sided joinings. 
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If along with the endomorphism (X, T, fi) there is a generating function 
/ : X — > R we have defined Ri(x) = f{T l {x)) for any x and i > 0. We extend 
this by stationarity to negative i giving a stationary sequence Ri, i £ Z. (Note 
that the map from stationary measures on i? N to those on i?~ N obtained by 
extension to Z and then restriction is a weak* homeomorphism.) If we also 
have an endomorphism (Y, £, z/) and a generating function g : Y — > R we define 
(7j(y) = g(S l (y)) and extend to indices i < similarly. 

Definition 3.2. A coupling (or a joining) of (X, T, //) and (Y, S 1 , z/) is one- 
sided if there are generating functions / and g so that the coupling of Ri and 
Ui, i > 0, is one-sided. 

Notice that if a coupling generates a one-sided coupling for one choice of 
generating functions / and g it will do so for all choices. 

Lemma 3.3. For sequences of random variables Rj and Uj and each 
value i, those couplings for which 

Dist({i? i } i<i |{ J R j } i >i,{LT,} j > i ) = Dist({i? i } j<i |{i? i } i > i ) 

are a weak* closed subset of all couplings. 

Proof. We begin with some basic reductions of the problem. Notice that 
this statement is simply about couplings of three measure algebras X\ = 
Vj^Rj, X2 = V ' j>%Rj and Y = Vj>iUi where the first two are coupled by 
a fixed measure ji. We consider those couplings of the three where X± and Y 
are coupled conditionally, independently over Xi- Viewed this way we see that 
without loss of generality we can assume all the random variables R{ and U{ 
(to be explicit) are two-valued. Next notice that the condition reduces to the 
countable list of conditions 

Dist({i? i } / < i<i |{i? j } j > i , {Uj}j>i) = Dist({ J R i } / < j<i |{i? i } i >i). 

To show that each such condition is weak* closed is equivalent to proving 
closedness when X\ is a finite space, i.e. we have reduced to the case of showing 
closedness of Dist^A^, Y) = Dist(P|X2) where P is a finite partition. Now 
suppose Vi all satisfy this condition and converge weak* to v. Upper semi- 
continuity of entropy tells us that limsupi? !/i (P|X2 V Y) < H V (P\X2,Y). On 
the other hand 

H V .(P\X 2 V Y) = H^(P\X 2 ) = H U {P\X 2 ) > H U {P\X 2 V Y) 

and H U (P\X 2 V Y) = H^(P\X 2 ) and fj, must also satisfy the condition. □ 
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COROLLARY 3.4. Suppose [i\ couples Ui and Ri one-sidedly and {i<i cou- 
ples Vi and Ri one-sidedly. There is then a coupling (i of all three sequences 
that projects to fii on the appropriate pair of sequences and when restricted to 
Ui and Vi is one-sided. 

Proof. Suppose ~p couples the three sequences Ri, Ui and Vi for i > I 
with the given marginals in some way, not necessarily one-sidedly. Extend 
~p to index I — 1 by first coupling on Rj-i relatively independently over the 
algebra \J {>I Ri. By the one-sidedness of the marginals, this maintains the 
marginals. Now couple on Uj-i to this relatively independently over the al- 
gebra Vi>/-i R% v Vj>7 Couple on V1-1 symmetrically. This preserves the 
marginals and 



i.e., for this one step the coupling is one-sided. 

We continue inductively to add on variables as I — > — 00 and obtain a 
coupling which is one-sided at all indices i < I. For each value I > start 
with ~p, the relatively independent coupling of the fn and [12 over the common 
algebra Vz>j Ri- Extend to the right as described above to produce a measure 
~pj. Let fi be any weak* limit of the couplings Jlj. Lemma 3.3 guarantees that 
fi is one-sided. □ 

Definition 3.5. Suppose (X,T,fa and (Y, S, v) are two endomorphisms. 
Then define J + ( (X, T, fa) , (Y, S, v) ) to be the space of all one-sided joinings 
of these endomorphisms. Define J+((X, T, fj,), (Y, S, v)) to be the ergodic and 
one-sided joinings. If there is no confusion we will just write J + and J+. 

Notice that Lemma 3.3 has shown the one-sided joinings to be a closed 
subset of all joinings. It is not difficult to see that if T and S are assumed 
ergodic, then the one-sided joinings are convex and that the extreme points of 
this set are the ergodic and one-sided joinings. (One just notes that the ergodic 
components of a one-sided joining must themselves be one-sided as the ergodic 
decomposition is past measurable.) Furthermore we see that if jX\ G J + of T 
and S and fa £ J + of T and U then there is a stationary joining of all three 
which projects to these two on the appropriate pairs and is in J + of S and 
U on this pair. Just observe that Corollary 3.4 gives us a one-sided coupling 
and by averaging over translates and taking a weak* limit we get a joining. 
By extending this further, we see that if the fa were in J+, i.e. were ergodic, 
then the one-sided joining of S and U can also be chosen ergodic. Almost any 
ergodic component of the one-sided joining just constructed will do. 



Dist(C/ J _ 1 |{C/ i } i > / ,{^} i > / ) 



Dist(C/ / _i|{C/ i } i > / }), 



and symmetrically, 



Distal I {[/;},>/, {K}^,) 



Dist(F / _ 1 |{F i } i > 7 }); 
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We consider two different weak* pseudometrics on processes of the form 
((X, T, n),f). First an endomorphism (X,T,/j.) and a function / : X — > R 
define a measure on R N . Let dist be a metric on C*(R N ) which generates the 
weak* topology. When we refer to 

dist(((X, T, /.),/), ((Y, S, u),g)) = dist(/, g) 

we mean the distance between the measures that ((X, T, //), /) and ((Y, S, v), g) 
generate on We mention a particular case of this dist (pseudo)metric 

topology to be used repeatedly. If fi is a joining of (X, T, fi) and (Y, S, v) and 
/ : X — > R and g : Y — ► £/ then /} projects to a stationary measure on (i? x C7) N . 
In this case we use the notation 

dist(fVgJvg). 

A uniform p-to-one endomorphism (X, T, fi) and a function / : X — ► i? 
define a measure on i? r as we have associated with each point x & X the T, R 
name 7^. Let tdist be a metric for the weak* topology on Borel measures on 
R T . When we refer to 

tdist(((X, T, /.),/), ((Y, S, v),g)) = tdist(/, 5 ) 

we mean the tdist distance between the measures these processes generate 
on R r . 

Lemma 3.6. The (pseudo) topologies generated by dist and tdist on uni- 
form p-to-one endomorphisms and tree adapted functions to R are the same. 

Proof. By Lemma 2.3 we can assume / generates. We have seen any 
tree adapted T,R name h generates a sequence of random variables Ri(h), 
i.e. a measure on Thus any measure m on the tree adapted T, R names 

projects to a measure on R~ N . If this measure comes from a uniform p-to-one 
endomorphism then it is stationary and maps homeomorphically to a measure 
on Call this measure *(m). Obviously, * is a weak* continuous map 
which shows that tdist is at least as strong as dist. 

For a uniform p-to-one endomorphism (X, T, fx) and a.e. x, the map 
x — > T x lifted to measures is an inverse for ^> . 

To see that is continuous, suppose {Ri} and {Rf}, j = 1,2,..., are 
sequences of random variables (measures on arising from tree-adapted 
functions on uniformly p-to-one endomorphisms with {Rj} — ► {Ri} i n dist. 
This is equivalent to putting all these random variables on a common measure 
space (fi, /t) (which might as well be (i? N ) z x R z ) with each R\ — ► Ri in /} 
probability. 
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We write R = {Ri}i e z, R j = {R-Uiei and R+ = {-RjjieN etc. Let 
be the uniform p-adic tree of inverse images of R + and Tgj that of R 3 + . All 
these labeled trees have the property that the labels of the p predecessors of 
any node are precisely 1 apart in R. The collection of such R labelings of the 
tree are a closed subset C of R? . 

Let C n C R? n consist of the labelings in C restricted to T n . On R 7 " use 
the sup metric up to tree automorphisms and on R n the sup metric. 

A labeling n G C n gives rise to p n distinct names in R n - the names 
along the p n branches. Call this set of names N(n). The critical observation 
here is this: Suppose for two labelings rji and 7/2 G C n , each element of N(j]i) 
is within e < 1/2 of some element in N{j}2)- Then the labelings 7/1 and 1J2 
themselves must be within e. Just notice that the labels along distinct branches 
of 771 cannot be matched within e of the same branch of 7/2 and moreover the 
matching must preserve the tree structure. 

Both R n and R Tn are compact and so for each e > there is a closed 
subset G = G{e) with fi(G) > 1 — e 2 /p n so that the map R + — > n is 

uniformly continuous on G. Hence there are an iV and 5 so that if R + and 
R' + €G and d(R { , R' { ) < 5 for i < N then d{Tg + n ,T^ J < e/2. 

As R? — > R in probability, for all j large enough there will be a subset 
H = H(e,j) of values R? with fi(H) > 1 — 2e/p n and for each R? £ H there is 
a representative value R{R?) with R + {R?) G G so that 

i) for -n < i < N we have d{R{,Ri(R?) < e/2 and 

ii) E(R + G G and d(Ri, Ri{R j )) < 5, l<i< n\R j ) > 1 - e. 

Let H' Q H consist of those R? for which all p n extensions of R\ in 

N(TL ) o R J + intersect H nontrivially, i.e. have good representatives. Now 

+ 

ju(iJ') > 1 — 2e and for any i2^_ G i?' all names in NiTlj ) must be within s 
of some branch of ^ + ^) ra and by our observation above 

d ( T R J + ,n' T R+(^),n) < £ " 

Now, by ii), for i# G i7' we have 

E(d(T^,T R+ ) < 2e\R j ) >l-e 

and so 

E ( d Cr^ n ,T n+n )<2e)>l-3e. 
We conclude that ^ J —>■ R in tdist. □ 
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This next lemma is important because it says that all of the joinings 
created in the next section are one-sided. 

Lemma 3.7. Suppose (X,T,fx) is a uniform p-to- one endomorphism and 
the factor map <p to (Y, S, v) is tree adapted. Then the joining fi of (X, T, li) 
and (Y, S, v) generated by <p is one-sided. 

Proof. Fix x and y = 4>(x). By stationarity it is sufficient to show that 
the conditional probability of preimages of y is the same as the conditional 
probability of preimages of y given x and y. As (p is a bijection from the inverse 
images of x to those of y the conditional mass of any T v (x) given x must be 
precisely that of S v (y) given y. This value is p~^ and hence the conditional 
expectation of each S v / (y) given x and y is the same as its expectation given 
y. The other set of equalities is obvious as x determines y so that conditioning 
on x and y is the same as conditioning on x. □ 

Definition 3.8. Suppose (X, T, li) and (Y, S, v) are two uniform p-to-one 
endomorphisms and / and g are functions into the same metric space (R, d) 
and n > 0. Now, 

i n (f,g) = i n (((X,T,fi),f),((Y,S,is),g)) 

= j d{f{T{x)),g{S\y))dj^ 

and 

tn{f,9) = tn(((X,T, M ),/),((y,S, !/),(/)) 

= inf (^J i n (%,T y )dp)j , 

where the in/'s are taken over all one-sided couplings fx. For comparison's sake 
we include the definition 

dn(f,9) = d n (((X,T,fi),f),((Y,S,u),g)) 

= inf (^E/ d(f(r{x)) jg (S\y))d^ , 

where the inf is taken over all couplings fx. 

On the face of it, i n is a metric on random variables indexed on — n < i < 
and t n on random variables indexed on < i < n. By stationarity of Ri these 
can be translated to be the same sets of random variables. The proof of the 
following lemma will be given later in the section. 

Lemma 3.9. Suppose (X,T,li) and (Y,S,v), are uniform p-to- one endo- 
morphisms with tree adapted functions f and g to R. Then i n (f,g) = t n (f,g). 
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We will not use the notation t n again except in the proof of Lemma 3.9, 
using just t n for both notions. 

Definition 3.10. Suppose (X,T,fi) and (Y, S, v), are uniform p-to-one 
endomorphisms and / and g are functions with values in R. We set 

t{f,9) = luninf 

Lemma 3.11. Suppose (X,T,fi) and (Y,S,v) are two uniform p-to- one 
endomorphisms and f and g are functions to R. There is then a stationary, 
ergodic and one-sided joining, (i, with 

t(f,g) = J d(f(x),g(y))d f i. 

In particular the lim inf in the definition is a limit. 

Proof. The simple weak* compactness argument completely analogous to 
that for d works here as the set of one-sided couplings is closed and convex, and 
the extreme points of the stationary and one-sided couplings are the ergodic 
ones. □ 

Definition 3.12. Suppose (X, T, (i) and (Y, S, v) are two endomorphisms, 
/ : X — > R is tree-adapted and ft £ J+ is an ergodic one-sided joining of them. 

e - 

We say / C Y if there is a one-sided and tree-adapted function / :Y — > R with 

A 

J d{f{x)J{y))djl{x,y) < e. 
Now, / c Y if / C Y for all e > 0. 

e - 
Notice that if / C Y one immediately obtains that for / of the definition 

A 

i(((X,T,v,),f),((Y,S,v),f))<e. 

Also notice that if / generates and / C Y then relative to the endomorphism 

A 

(X,T,n) sits as a one-sided and tree-adapted factor of (Y,S,v). 

Now we show that a one-sided coupling lifts naturally to a measure on 
X x Y x A. This is the essential ingredient for showing that i and t are equal 
and is also necessary in the copying lemma. This lift is not unique. (In the form 
we now describe, the direct product of two uniform p-to-one endomorphisms 
has many potential lifts to a third automorphism coordinate.) Let h and h! 
be R- valued and [/-valued tree names and Ri(h) and Ui(h'),i < 0, be the 
sequences of random variables they generate. For any automorphism A we 
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construct a joined name h a(v) = {h(A 1 (v)) , h' (v)) . Such a name will project 
to a measure on i?~ N x U~ m that is a one-sided coupling of Ri(h) and Ui(h'). 
Call it A(h,ft',i4)- 

Lemma 3.13. Suppose (X,T,(i) and (Y,S,v) are two uniform p-to- one 
endomorphisms, x £ X , y 6 Y are two points, and f : X — > i? and 5 : Y — > U. 
The one-sided couplings of the form (i(T x ,T y ,A)i A £ A, are the extreme points 
of the one-sided couplings of Ri(T x ) and Ui(T y ) and any one-sided coupling ft 
of Ri and Ui, i < 0, is of the form 

J V(T x ,T y ,A) dm (x, y ){A)djl{x,y) 
for some family of probability measures rn^ x ^ on A. 

Proof. We only need to show that any one-sided coupling ~p of variables 
of the form Ri(h) and Ui(h') is of the form 

V- = j fr{h,h',A) dm(A) 

for some measure m on A. The proof is by induction. We first show this 
for a single variable. This is equivalent to showing that any self-coupling of 
uniform measure on {0, . . . ,p — 1} is an average of measures supported on 
graphs of permutations. To see this suppose /to is such a self-coupling of 
{0, . . . ,p — 1}. The knowing relation on {0, . . . ,p — 1} x {0, . . . ,p — 1} given 
by fio(i,j) > satisfies the hypotheses of the Hall marriage lemma and hence 
there is a bijective subrelation; i.e., /to = aft n + (1 — cc)/ti with a > and fi n 
supported on the graph of the permutation ir. Repeating this for fii and so on, 
we obtain a representation of the measure as an integral of measures supported 
on graphs of permutations. Using the one-sidedness of Ji we complete the result 
inductively as we conclude that jx is written uniquely as an integral of couplings 
which node by node sit on the graphs of permutations applied at each node, 
i.e. sit on graphs of tree automorphisms. □ 

We are ready to show that t and t agree. 

Proof of Lemma 3.9. We begin once more noting that by translating the 
random variables Ri and Ui by n, both t n and i n are calculated as infima over 
couplings of variables indexed on — n < i < 0. We have already noted that 
a pairing of the nodes of T n by a tree automorphism A when viewed on the 
names along the branches of the tree gives a one-sided coupling £i(h,h',A) °f the 
distributions of names. This is enough to conclude that t n < i n . As a one-sided 
coupling of Ri(%) and Ui(T y ) can be written as an integral over X x Y x A 
of couplings fi(T x ,T y ,A) we see the other inequality t n < t n . □ 
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4. Tree finitely determined 

Now that we have discussed the theory of one-sided couplings we are ready 
to define tree finitely determined. This will play a major role in the proof of 
the copying lemma. 

Definition 4.1. We say ((X,T, fi), f), where (X,T,n) is a uniform p-to- 
one endomorphism and / : X — > R is tree adapted, is tree finitely determined 
(tree f.d.) if for every e > 0, there is a 5 such that for any endomorphism 
(Y, S, fi) with function g : Y — > R with tdist(/, g) < S then t(f, g) < e. 

If / is strongly tree adapted then tdist here can be replaced with dist. 
This will enable us to work with strongly tree adapted functions. Now we 
need the following lemma. Later, as we will see, tree f.d. and tree v.w.b. are 
equivalent. We will see that all tree-adapted factors of a tree f.d. map are tree 
f.d. At this point we need something substantially less. 

Lemma 4.2. Suppose ((X,T, //), /) is tree f.d. where f is a generator. 
Then for any bounded map h : X — > R, ((X, T, fi), f V h) is tree f.d. 

Proof. This argument follows well established lines by approximation of 
h by a "finite code". Each successive step simply requires a closer match in 
tdist. To begin, as / is a generator, h can be approximated arbitrarily well in 
L 1 ( / u) by maps of the form 

H(f(x),f(T(x)),...J(T N (x))) 

where H is a continuous map from R N+1 — > R and N is finite. If ((Y,S,u), 
g V h!) is sufficiently close in tdist to ((X, T, /x),/ V h) then H(g(y), 
g(S(y)), . . . , g(S N (y))) will of necessity also be a good L 1 ^) approximation 
oih'. 

Now if ((Y, S, v),gVh!) is close in tdist to ((X, T, /i),/V/i) then ((Y,S,v),g) 
is close in tdist to ((X, T, /x), /) but is not necessarily tree-adapted. As / is tree- 
adapted though g must separate inverse images of most points, so some pertur- 
bation g' of g which agrees with g on most of Y will be tree-adapted. If we are 
close enough in tdist then we will still have H(g'(y),g'(S(y)), . . . ,g'(S N (y))) 
a good approximation to h! in L l (v) and ((Y, S,u), g') close in tdist to 
((X, T, n),f), hence close in t. Now if ((Y, S, v),cf) and ((X, T, u), f) are close 
enough in t (notice that how close can be set after the value N and contin- 
uous map H are fixed) then ((Y, S, v), g' V H(g', g' o S, . . . , g' o S N )) will be 
close in tdist to both ((Y, S,u),gV h!) and to ((X, T, fi), f V /i) which is to say 
((X,T,n),f Vh) is tree f.d. □ 

Corollary 4.3. Any tree f.d. process has a tree f.d. generator that is 
strongly tree adapted. 
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Proof. Choose h in Lemma 4.2 to be a map to {1, 2, . . . ,p} that separates 
inverse images. □ 

Lemma 4.4. If ((X,T, /x), /) is tree v.w.B. then it is tree f.d. 

Proof. Suppose ((X,T, fi), f) is tree v.w.B. Given e > choose an n so 
that there exists a subset Xq C X of measure > 1 — e and a fixed 7^,i? 
name h n so that t n (T x , h n ) < e for any x G Xo. For each i G X let Ar be 
an automorphism that realizes the minimum in the definition of i n (T x ,h n ). 
Using Lemma 3.6, choose a 8 > so small that if (Y, S 1 , zv) is a uniform 
p-to-one endomorphism, g is a function to i?, and dist(/, g) < 8 then there 
exists a subset Yq C Y of measure greater than or equal to 1 — 2e such that 
^(Tj,, /i n ) < 2e for all y G Y . 

Consider aT,E name /i constructed by tiling T with copies of /i n . More 
precisely, for any v such that \v\ = jn for some j and any i/ G rj n let h(vv') = 
h(v'). For each x G X we will inductively construct an automorphism A 
which will show % and h are close in t. The matching we use is a greedy 
algorithm matching n levels at a time. For each v G r) n let A(v) = A x (v). Now 
assume A has been defined on all v G rjj n . For each v G rjj n and u' G r/ n let 
A(vv')=A(v)A Ta(v){x) (v'). 

Now we calculate tk n {%, h). Let Gk n {x) be those nodes with |u| = jn for 
some < j < k and r„(x) G Xq. Let M(Gk n (x)) be the sum of p - '"! over all 
v G Gkn(x)- This construction leads to the calculation: 

t k n(%, h)<l- M{G kn {x))/k + sM{G kn {x))/k. 
The fact that T is measure-preserving implies 

J M(G kn {x))dn{x) = kn(X ) > k(l-e). 

Hence for all but y/2e of the x G X, 

t kn (%,h) < V2l. 

Precisely the same argument applied to Y yields that for all but of the 

iknVy, h) < lyfl. 

We conclude that 

t(f,g) = liminf t n (f,g) < A^fe 

n — »oo 

which ends the proof. □ 

Corollary 4.5. The standard endomorphism with the usual p set inde- 
pendent generating partition is tree v.w.B and hence tree f.d. 
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Proof. It is trivial that with this partition every point b has the same past 
tree name. Hence it is tree v.w.B. □ 

We postpone the converse of Lemma 4.4 as we will use the one-sided 
conjugacy theorem to prove it. 

5. The copying lemma 

In this section we prove the isomorphism theorem for uniform p-to-one en- 
domorphisms. First we prove a Rokhlin lemma, then a strong Rokhlin lemma. 
Next, we prove the copying lemma. The isomorphism theorem will follow eas- 
ily from the copying lemma. Notice that the strong Rokhlin lemma is proved 
only for finite valued / : X — > R; i.e., / is a partition. To emphasize this we 
refer to partitions P and Q instead of functions / and g. 

The first step is to prove a Rokhlin lemma for uniform endomorphisms. 
This result has appeared previously in the work of Rosenthal [11]. We present 
a proof here as his is perhaps too brief. 

Definition 5.1. Let (X, T, fi) be a uniformly p-to-one endomorphism and 
T v be some choice for the partial inverses of T. AT n Rokhlin tree is a collection 
of disjoint sets B v C X, v G rj n , with the property that T V (B$) = B v . 

Theorem 5.2. Let (X,T,/j,) be a uniformly p-to- one and ergodic endo- 
morphism and T v be some choice for the partial inverses ofT. For each n > 
and e > there exists a T n Rokhlin tree B v with fj,(L) v B v ) > 1 — e. 

Proof. For any set C and n > define 

B ih = {x\ min{i > : T\x) G C} = mod (n + 1)} \ (U < i <„T i (C7)). 

Suppose x G Bfjj n T V (B$) for some Then since Uo<j< ra T _l (a;) n C = 

for any point x 6 Bq, 

min{i > : T\T v (x)) G C} — min{i > : T\x) G C} = |u|. 

Both of the terms on the left-hand side cannot be equal to mod (n+ 1) unless 
\v\ = 0. Thus B® n T„(5 ) = for any < \v\ < n which implies B v = T V (B$) 
forms a Rokhlin tree. 

Since /j,({x \ (minj>oT l (x) G C) = i}) is nonincreasing we have 

A*(B ) > l/(n + 1) - (n + l)p n+1 n{C) and ^(U„S„) > 1 - (n + 1)V + V(C7). 

This last term can be made as small as we like by choosing //(C) small. □ 

Now we prove a strong Rokhlin lemma. This says that the top level B$ 
can be chosen independently of any partition. 
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Lemma 5.3. Suppose (X,T,[i) is an ergodic uniformly p-to- one endo- 
morphisms and P is a finite partition of X. For any e > and n there is a 
T n - Rokhlin tree C v so that 

H (U ver)n C v ) > 1 - e 

and 

Dist(P) = Dist(P|C7 ); 

i.e., P and C$ are independent. 

Proof. Given e choose m > 4(n+ l)/e. Let Bq be the top of a T m Rokhlin 
tree with U v&Vm B v > 1 — e/2. For each 

Ae (V v evMP))\B 
write A = U^Ai with each fx(Ai) = fi(A)/(n + 1). Now let 

c* = u A6(v ^^ T , (P))|fl0 u-ir i )- 2 u ? r-^ 1 )-(A). 

Then Cq forms the top of a T n Rokhlin tree with \J ver)n C v > 1 — e. We also 
have 

Dist(P | U^o"" 1 ^^)) = Dist(P | C ). 

Thus for any element Pi of P 

M^nCj) > (l-e) f i(P i )/(n + l). 

Then we pare down C$ to C'^ so that for every element Pi of P we have precisely 

fi(Pi n C' % ) = (1 - e)n(Pi)/(n + 1) 

and we are done. □ 

Now we are ready to prove the copying lemma which is the main element 
in the proof of Theorem 5.5. 

Lemma 5.4 (copying lemma). Suppose (X, T, fi) and (Y, S, v) are uniform 
p-to-one endomorphisms with finite tree adapted functions f and g and fi G 
an ergodic joining. For any e > there is a tree adapted function f : Y — > R 
with 

dist(f$g,fVg) < e. 
As a consequence, if Ri is tree f.d. then 

O e , = {fi£j+\fcY} 
is an open and dense subset of J+. 
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Proof. First we show the result for finite partitions P and Q instead of 
functions / and g. The definition of dist gives an n and a 5 such that if 

£ \u{A)-fa(A)\ <1-S, 

Ae\/^(TxS)- i (P\/Q) 

then 

dist(P VQ,PVQ) < e. 

For any partition P and any m define a new partition P m so that x and x' are in 
the same element of P m if and only if i m (T x ,T x i) = 0. Choose m > 4n/5. Build 
T m Rokhlin trees B v C X and C v C Y so that fi(U v B v ) = v(U v C v ) > 1 - <5/2, 
f?0 is independent of P m , and C0 is independent of Q m . 

Next define a measure-preserving map /t : B$ — > C0 so that the measure 
/i generates on i?0 x restricted to P m V Q m is the same as the measure jl 
restricted to P m V Q m . This is possible because Bq is independent of P m , and 
C0 is independent of Q m . 

As ft is one-sided we can write it as 

A = J V(T x ,T y ,A) dm {Xty) dfi(x,y). 

Lift jl to X xY x A as 

A = y 4 x <5 y x m^^) dp,(x,y). 

Now P m x Q m x An is a finite partition of X x Y x A. As Qj is independent 
of Q m we can extend 1/ on C$ to P on C$ x A to be identical in distribution on 
Q m x A m to ft (when normalized) . We can now construct a measure-preserving 
map h : (Bid, fi) — > (Qj x .4, so that the normalized measure supported on 
the graph of h restricted to P m x Q m x A m agrees in distribution with Jl. 

Now we are ready to define the new partition P. Write h(x) = (h(x),A(x)). 
For each and every v £ T m set 

P(T A[x)(v) h(x)) = P(T v (x)). 

On the rest of the space define P in any way such that P is tree-adapted. 
Now for any set A G Vft(T x S) _i (P V Q), 

v{A n (u M<m _ n c w )) = A(^4 n (u M<m _ n s w )). 

Since A(U|„| <m _ n B„) = ^(U|„| <m _ n C„) > 1 - 5 this implies 

x: KA)-A(^)i<i-<5- 

Aev^(TxS)- l (PvQ) 

Thus . 

dist(PvQ,PvQ) < e. 

This completes the proof of the first statement for finite partitions. 
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To extend the result to strongly tree-adapted functions do the following. 
Partition R into sets of small diameter. Choose one representative point in 
each element of the partition. Define F : R — > R to map all points in each 
partition element to its representative point. It follows that if the partition is 
fine enough then dist(/, Fof) will be small. Do the same with g, i.e. construct 
a finite valued G : U — > U. Once more if the partitions are fine enough then 
we will have 

dist(f\/g,Fof\/Gog) < e /3. 

As / and g are strongly tree-adapted, if the partition elements are less than 1 
in diameter Fof and Gog will also be strongly tree-adapted (by the discrete 
metric on their finite ranges). We can now apply the finite partition version 
proven above to F o f and G o g to construct / with 

dist(F o f\/Go gJvGo g) < e /3, 

knowing G comes from a fine enough partition, independent of the choice of / 
that 

dist(/VGo 5 ,/v 5 ) <e/3. 
For the second result notice that the previous equation implies 

dist(/, /) < e. 

For any S if ((X, T, //), /) is tree f.d. then e and / can be chosen so that 

i(fj)<5. 

For any 5\ if 5 and e' are small enough then by Lemma 3.4 we can extend v 
to a one-sided and ergodic joining p, with 

dist(/V g, fVg) < S 1 and f CY 

where O e > is dense. Openness follows easily from the definition. □ 

Theorem 5.5. A uniform p-to-one endomorphism (X,T,fi) is one- 
sidedly conjugate to the standard uniform p-to-one endomorphism (B,a,i>) if 
and only if there exists a generating function f so that (X, T, //) and f are tree 
v.w.B. (or equivalently tree f.d.). 

Proof. Let / be a strongly tree adapted function from X to R and K, the 

standard independent generating partition of B. We know is a Gg subset of 

J + in the weak* topology. Since (X, T, /i) and / are tree v.w.B. they are also 

tree f.d. Thus the copying lemma tells us that the O e are open and dense in 

. Intersecting over e = 1/n, we see that the Baire category theorem shows 
o 

that the set of ft with / C B is a residual subset of Jj. Let K be the standard 
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independent generating partition of B. As the standard example is also tree 

o ' , 

v.w.B. and tree f.d., the set of fi with K C X is a residual subset of Jj. Thus 

the set of ft with K C X and / C B is nonempty and (X, T, /i) and (B, a, v) are 

A A 

isomorphic. 

Notice that we need only assume / tree adapted here and not necessarily 
strongly, as we can extend / to an / V h which is still tree f.d. and now is 
strongly tree adapted and hence isomorphic to the standard example. 

All that remains is to show tree f.d. and tree v.w.B. are equivalent. We 
already know that tree v.w.B. implies tree f.d. For the other implication, we 
have just seen that tree f.d. implies one-sidedly conjugate to the standard action 
which is tree v.w.B. We have also seen that tree v.w.B. descends to one-sided 
and tree adapted factors. Hence any uniform p-to-one endomorphism which is 
isomorphic to a tree v.w.B. endomorphism is tree v.w.B. Thus tree f.d. implies 
tree v.w.B. □ 



6. Examples of tree v.w.B. skew products 

We will show now that general classes of isometric extensions of standard 
endomorphisms are all one-sidedly Bernoulli. Among these will be the [T, Id] 
endomorphisms, where T is an irrational rotation. These were described in the 
first section. 

Throughout this section we consider p to be fixed. Remember that (B, a, v) 
is the standard uniform p-to-one endomorphism. We also fix (Z, d) a compact 
metric space with 1 its space of isometries. We assume 1 acts transitively, i.e. 
Z is a homogeneous space, and has Haar measure. We put on I the uniform 
topology. Given a function / : B — > X we construct the cocycle extension Tf 
acting on B x Z by 

T f (b,z) = (a(b),f(b)(z)). 

The map a has a natural set of partial inverses a v . These extend to form 
a natural set of partial inverses (Tf) v . Set c m = supi„| =m (diam(/((7„(B)))). 

Definition 6.1. We say / generates a summable cocycle if J2m c m < 

Notice that any / depending on only finitely many coordinates bo, . . . , b m 
automatically generates a summable cocycle. Thus the [T, Id] endomorphisms 
generate summable cocycles. 

Lemma 6.2. Suppose f : B I generates a summable cocycle. Then 
for any e > there exists 5 such that if d((b, z), (6', z')) < 5 then d((Tf) v (b, z), 
(T f ) v (b' , z')) < e for all v G r). 
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Proof. Given e there exists no so that J2 n >n c « — e /^- Choose 5 < e/2 
such that d(b, b') < 5 implies that bi = b\ for all < i < uq. By the definition 
of c if bi = b\ for all < i < no then 

d((T f ) v (b, z), (Tf) v (b', z')) < d((b, z), (b', z')) + ]>>„<£. □ 

n>no 

Lemma 6.3. Suppose f : B — > Z generates a summable cocycle and Tf 
is weakly mixing. For any e > there is an N > so that (Tf)~ N (6, z) is e 
dense in B x Z for all (b, z) . 

Proof. From [12] we know that if Tf is weakly mixing then it must be 
v.w.B. and hence K. The fact that Tf is a if-system implies that there exists 
an N' such that for most b and all z we have that (Tf)~ N (b,z) is e/2 dense 
in B x Z. In particular this holds for one b. Lemma 6.2 implies that there 
exists N" such that if 6; = b\ for all < i < N" then (Tf)~ N '(b', z) is e dense 
in B x Z. For every b" there exists b G 6" such that 6« = 6« for all 

< i < N". Thus (Tf)~ N '~ N " (b' , z) is e dense in 5 x Z. □ 

Theorem 6.4. Suppose that f generates a summable cocycle and that Tf 
is weakly mixing. Then Tf is tree v.w.B. 

Proof. Given e we get a 5 from Lemma 6.2 and an iV from Lemma 6.3 
which implies that (Tf)~ N (b, z) is 5 dense in BxZ for all (b, z) G BxZ. For any 
(6, z), (&', z') € B x Z we define the tree automorphism A that pairs % h z \ with 
Tib',z') inductively. We pair at least N levels at a time. If d({b,z), (b',z')) < 5 
then by Lemma 6.2 we are done. Otherwise Lemma 6.3 implies that there 
exists a tree automorphism An which pairs at least one preimage of (6, z) with 
a preimage of (b ; , z 1 ) so that the pair is within 5. 

Now suppose we have defined A up to at least level kN . We will extend 
it to at least level [k + 1)N. If A(vv') has been defined for all v ' 6 r] then we 
need to do nothing. If it has not and d((Tf) v (b, z), (Tf)^ v ^(b' , z')) < 5 then 
extend it by the identity automorphism. By this we mean, for all v' £ r), that 
A{vv') = A{v)v' . If neither of the above conditions is satisfied then we use 
Lemma 6.3 to tell us how to define A{vv') for all v' £ n^. 

Choose k so that (1 — \/p N ) ek < e. By the previous paragraph and 
Lemma 6.2 we know that for each n > ekN the fraction of preimages that 
is paired within e is at least 1 — e. Thus 

t kN ((b, z),{b',z')) <3e 

and Tf is tree v.w.B. □ 

As well as the [T, Id] examples, our methods cover the following smooth 
endomorphisms. Replace B by x — ► 2x on R/Z and let Z = R/Z as well. Set 
f(x) to be any Holder function to R/Z that is not a coboundary (for example 
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f(x) = sin(27rx)) giving a smooth action on the 2-torus. That / is not a 
coboundary means Tf is a weakly mixing action and that / is Holder implies 
that it generates a summable cocycle. Thus we conclude that such an action 
must be tree v.w.B. It would be interesting to know if there can be a smooth 
and uniformly p-adic action that is v.w.B. but not tree v.w.B. Our work here 
shows it will not be found among the isometric extensions of x — > px mod 1 
with summable cocycles. 
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